Glottal Excitation Feature based Gender Identification System using Ergodic HMM
نویسندگان
چکیده
In this paper, through different experimental studies it is demonstrated that the time varying glottal excitation component of speech can be exploited for text independent gender recognition studies. Linear prediction (LP) residual is used as a representation of excitation information in speech. The gender-specific information in the excitation of voiced speech is captured using the Hidden Markov Models (HMMs). The decrease in the error during training and recognizing genders during testing phase close to 100 % accuracy demonstrates that the excitation component of speech contains gender-specific information and is indeed being effectively captured by continuous Ergodic HMM. A gender recognition study using gender specific features for different HMM states, mixture components, size of testing data on the performance of the gender recognition is evaluated. We demonstrate the gender recognition studies on TIMIT database. KeywordsGender, Hidden Markov Model (HMM); LPC; MFCC
منابع مشابه
Performance Evaluation of Statistical Approaches for Text Independent Speaker Recognition Using Source Feature
This paper introduces the performance evaluation of statistical approaches for Text-Independent speaker recognition system using source feature. Linear prediction (LP) residual is used as a representation of excitation information in speech. The speaker-specific information in the excitation of voiced speech is captured using statistical approaches such as Gaussian Mixture Models (GMMs) and Hid...
متن کاملHMM-based Finnish text-to-speech system utilizing glottal inverse filtering
This paper describes an HMM-based speech synthesis system that utilizes glottal inverse filtering for generating natural sounding synthetic speech. In the proposed system, speech is first parametrized into spectral and excitation features using a glottal inverse filtering based method. The parameters are fed into an HMM system for training and then generated from the trained HMM according to te...
متن کاملRobust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech
This paper presents a robust feature extraction method effective to speech signal with high fundamental frequency and/or corrupted by additive white noise. The method represents the glottal source wave using HMM in order to model the nonstationary properties. The nodes of HMM are concatenated in a ring state to represent the periodicity of voiced sounds. The method can accurately extract glotta...
متن کاملPerformance Analysis of Text To Speech Synthesis System Using HMM And Prosody Features With Parsing For Tamil Language
This paper describes a Hidden Markov Model (HMM) based (TTS) system and prosody based (TTS) system for producing natural sounding synthetic speech in Tamil language. The (HMM) based system consists of two phases such as training and synthesis. Tamil speech is first parameterized into spectral and excitation features using Glottal Inverse Filtering (GIF). An emotions present in the input text is...
متن کاملThe GlottHMM Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation
This paper describes the GlottHMM speech synthesis system for Blizzard Challenge 2011. GlottHMM is a hidden Markov model (HMM) based speech synthesis system that utilizes glottal inverse filtering for separating the vocal tract and the glottal source from speech signal and models both components individually. In this year’s entry, stabilized weighted linear prediction (SWLP) is used to yield mo...
متن کامل